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Q>^ I We present the first results of the Karhunen-Loeve (KL) eigenmodes applied 

^ I to real data of the Las Campanas Redshift Survey (LCRS) to simultaneously 

measure the values of the redshift-distortion parameter, j3 = Q^^/b, the linearly 
extrapolated normalization, Ug , and the CDM shape parameter, T = Qoh. The 



6 ■ 

^ I a shape parameter P = 0.16 ± 0.10, and a linearly extrapolated normalization 

(Tg = 0.79 ± 0.08, which are consistent with a low density universe, Qq ~ 0.5. 



O 

H ' results of our numerical likelihood analysis indicate a low value of /? = 0.30 ±0.39, 



Subject headings: cosmology theory — galaxies distances and redshifts 
large-scale structure of universe — methods statistical 



1. Introduction 

The accurate measurement of cosmological parameters has been a long-standing 
challenge for cosmologists. Fortunately, the rapidly increasing size of redshift surveys is 
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moving the estimation of many of these parameters out of the shot-noise hmited regime. 
With these larger data sets, more precise measurements now depend on correspondingly 
more sophisticated methods of analysis. For example, in estimating the power spectrum of 
galaxy clustering, one of the greatest challenges is in properly accounting for the effects of 
a finite survey geometry and the effects of redshift distortions on the signal. 

The observed power spectrum is a convolution of the true power with the Fourier 
transform of the spatial window function of the survey, Pobs(fc) = / Ptruc{k' ) \W {k — k')\'^d^k' . 
One can attempt to deconvolve the true power spectrum or compare to convolved theoretical 
spectra, but in either case the survey geometry limits both the resolution and the largest 
wavelength for which an accurate measurement can be obtained. 

The standard methods for power spectrum estimation (e.g.. Park ct al. 1994; Feldman 
et al. 1994; Fisher et al. 1993) work reasonably well for data in a large, contiguous, 
three-dimensional volume, with homogeneous sampling of the galaxy distribution, and a 
weighting scheme optimized for the shot-noise dominated errors. Using these techniques, 
nearby wide-angle redshift surveys (CfA, SSRS, IRAS 1.2, QDOT) yield strong constraints 
on the power spectrum on scales approaching 100/i~^Mpc. Tegmark et al. (1998) provides a 
detailed comparison of power spectrum estimation methods in cosmology. 

Because the uncertainty in the power spectrum depends on the number of independent 
modes at a given wavelength, constraints on larger scales require deeper surveys. Due to the 
difficulty of obtaining redshifts for fainter galaxies and limited telescope time, deep redshift 
surveys typically have complex geometry, e.g., deep pencil beams or slices. Unfortunately, 
the standard methods are not efficient when applied to data in oddly-shaped and/or 
disjoint volumes, or when the sampling density of galaxies varies greatly over these regions. 
Moreover, convolution of the true power with the complex window function causes power in 
different modes to be highly coupled. In other words, plane waves do not form an optimal 
eigenbasis for expansion of the galaxy density field sampled by such surveys. Other intrinsic 
problems arise due to redshift distortions and the effects non-linear fluctuation growth. 

As a consequence, advanced methods for power spectrum estimation are needed that 
optimally weight the data in each region of the survey, taking into account our prior 
knowledge of the nature of the noise and clustering in the galaxy distribution. These 
methods must also incorporate the effects of redshift-distortions to produce unbiased and 
robust measurements. In this paper we describe a technique that can take all these effects 
into consideration, and present the flrst results apphed to real data. The method employs 
Karhunen-Loeve (KL) eigenmodes and is based on the technique outlined by Vogcley and 
Szalay (1996) (see also Hoffman 1999), merged together with the analytic results of redshift 
distortions in wide angle redshift surveys by Szalay, Matsubara and Landy (1998). This 
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analysis uses the largest publicly available redshift survey, the Las Campanas Survey, 
LCRS, (Shectman et al. 1996). 

2. Construction of the Eigenbasis 

2.1. A Short Overview of the Karhunen-Loeve Transform 

In a KL analysis, the survey data is represented as galaxy counts in a finite number of 
N cells. In practice, the data vector d is defined as 

di^nf^^'^{ci-ni), (2.1) 

where q is the observed number count of galaxies in the i-th cell, and Ui = (ci) is the 
expected number of galaxies, based on the number of fibers in the LCRS observation. The 

I/O 

factor whitens the shot noise term (see Vogeley & Szalay 1996). This vector is then 

expanded over the KL basis functions as 

cZ = $^S„*„. (2.2) 

n 

The cosmological information is contained in the amplitude and distribution of the 
coefficients 

The KL eigenmodes are uniquely determined by the following conditions: (a) 
Orthonormality, ■ — ^nm, ^-^d (b) Statistical orthogonality, {BnB^) = {B^)Snm- 
This is equivalent to the eigenvalue problem — A„*„ (Vogeley & Szalay 1996), where 

Rij = (didj) = rii^'^ny^^ij + Sij + (2.3) 

is the correlation matrix calculated for this geometry and choice of pixelization. rjij describes 
the additional noise terms arising from systematic effects, like extinction, or total number 
of fibers in a given area of the sky. 

Although there is a large degree of freedom in the choice of pixelization, it is 
advantageous to choose a pixelization with the lowest resolution appropriate to the question 
at hand to reduce computing time, which is proportional to A^^. Since this research focused 
on cosmological measurements in the linear regime, the survey volume was divided into 
cells about 15 x 15 x 40(/i~^Mpc)^ in size. The cells are elongated in the direction of the 
line-of-sight, to reduce the Finger-of-God effects. The cell boundaries are based on polar 
coordinates, closely following the original tiling of the survey. This resulted in 1440 and 
1503 cells for the Northern and Southern sets of slices in the LCRS, respectively. 
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2.2. Computation of the Correlation Matrix in Redshift Space 

Since the data resides in redshift space, the correlation matrix must be calculated 
in redshift space to decompose the signal properly. The difficulty here is in calculating 
the cell-averaged correlation function in redshift space, which is used to construct the 
correlation matrix R. Szalay, Matsubara & Landy (1998) derived an analytic expression 
for the two-point correlation function in redshift space without using the distant observer 
approximation. In this expansion, ^^^^ is given by 

i^'\r„ r,) = cooCf + co2d°^ + co,iP + ... ; 

^P{r) = 2^/ dkk'k-^Mkr)P{kl (2.4) 

where 

Coo = 1 + + - ^13^ cos' sin' 9, (2.5) 

C02 = - + cos 2eP2{i^) -1{(3-\P' + sin' 9^ sin' 9, (2.6) 

C04 = Yb^'PM - Yif^' sin' 9PM - i/3' - ^ sin' ^) sin' 9, (2.7) 

and P = flQ^/b, the usual parameter used in relating velocities to the density field, where 
b is the bias parameter. The additional coefficients in a complete expansion (cn, C13, 
C20; C22) are small enough so that they can be ignored in this analysis. The geometry 
of any two points, Vi and Vi is parameterized by r = |ri — r2|, cos(26') = fi • f"2, and 
H = cos9{ri — r2)/r. 

The cell-averaged correlation function is calculated by numerically integrating the 
above equations. This is done by an adaptive Monte-Carlo integration for adjacent pairs of 
cells, and by a second-order Taylor approximation for more distant pairs. 

iij^-^l I dvidvji^'\ri,r2) 

VjVj Jvi Jvj 

A CDM-type power spectrum with F = 0.2, cTg = 1.0, and /3 = 0.5 is used to construct the 
initial KL basis. This initial choice does not bias any subsequent results, since we adopted 
an iterative procedure for our likelihood analysis. 

After determining the eigensystem of the matrix R, the KL modes are sorted by 
descending eigenvalue. The eigenvalues closely represent the signal-to-noise ratio of each 
mode. In addition, the KL modes with large eigenvalues correspond to the larger wavelength 
fluctuations (Vogeley & Szalay 1996). For further analysis we only use the flrst M modes 



(2.8) 
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(M < N). This both reduces the necessary computations and selects modes where hnear 
theory is more apphcable. 

How do we select Ml For the essentially two-dimensional geometry of the LCRS 
survey the 3D window function in k-space is a very elongated cigar, with the major axis 
perpendicular to the plane of the survey, while the 2D window function is extremely 
compact (Landy et al. 1996). The KL modes fill the available /c-space as densely as 
possible, given the survey geometry. The long wavelength KL modes in our truncated basis 
correspond to the densely packed 2D modes, thus the cutoff wavelength is proportional to 
M"^/^, where M is the number of modes in the truncated set. A true 3D survey would 
yield better results, since the cutoff wavelength would scale as M~^/^. 

On the one hand, one would like to select as many modes as possible, because then the 
cosmic variance of the measured parameters is smaller, due to the averaging over a larger 
set of random numbers. On the other hand, including lower signal-to-noise modes not only 
brings us closer to non-linear scales but also dilutes the signal-to-noise. It is non-trivial 
to balance this issue of cosmic variance versus non-linearity especially given the complex 
geometry of the LCRS. A natural method is to inspect the sorted window functions to 
see the relevant scales of each mode to single out the inappropriate scales. However, we 
found the resulting window functions had much too complex shapes for that purpose. This 
is because the Las Campanas Survey has a complex geometry and a selection function 
which varies from field to field and consequently makes eigenmodes form complex window 
functions in terms of Fourier modes. In this paper, we choose the maximum number of 
modes M which reproduces reasonable estimates of cosmological parameters in analyzing 
mock catalogs drawn from A'"-body simulations in which true values of parameter are 
known. 



3. Likelihood Analysis 

The likelihood function (LF) is obtained from the expression 

1 



JC OC I det Cmodel | ^''^ GXp 



(3.9) 



where Cmodei is the covariance matrix computed from our theoretical model hypotheses 
for a set of parameters Ug , F), rotated to the KL basis. This matrix is very close to 
diagonal. For i, j — 1, . . . , M, 



model ) ij 



{B,B,) 



i I model 



model ^ J ) 



(3.10) 
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In practice, the correlation matrix can be expressed as a linear combination of several 
matrices, proportional to powers of (3 and a\. In this analysis, a\ is always understood as 
the linear amplitude of the fluctuation spectrum. The shape of the respective correlation 
functions only depends on F, therefore we computed the matrices for each value of F, 
but then computed their linear combinations for the various values of (3 and cxg . The 
calculations are still quite computationally intensive, and were only possible by using 
efficient numerical algorithms. Details of our numerical analysis will be described in a 
longer, more technical paper (Matsubara et al. 1999). This paper will also discuss other, 
higher- dimensional parameterizations of the power spectrum, like the use of band-power 
amplitudes. 

Our original fiducial choice of parameters determined the initial KL basis. After the 
maximum of the LP is determined, the KL basis is recomputed at that point, and the 
likelihood analysis repeated. In the subsequent section, the results are reported as both LP 
contours, and marginalized one-dimensional LP. 

4. Analysis of N-body Simulations 

The N-body simulations were kindly supplied by C. Park and are the same ones that 
have been used in earlier analyzes of the LCRS (Landy et al. 1996, Lin et al. 1996). The 
simulation is an open CDM model with h = 0.5, Qq = 0.4, and 6 = 1. The model was 
normalized so that ag = 1. Thus, in this analysis, j3 = 0.577, F = 0.2. Determination of Ug 
from the data is problematic due the nonlinear effects and the finiteness of the volume. 

The three-dimensional LP for M = 100, 150, 200 was computed in a 21^ grid in 
parameter space 11. The LP was then marginalized with respect to each parameter, f3, F, cig . 
The resulting discrete LP was fitted by Gaussian curve, in which the center and the variance 
are identified with our estimate of the parameters and its la error bars. In Figure 0, the 
resulting estimates are plotted. Experimenting showed that iterating the basis did not 
change the estimation, thus for the model data we did not iterate in this figure. 

The N-body results show that there is excellent agreement with the shape parameter 
F, fairly good agreement with j3. It is difficult to determine what fiducial value should 
be used for in the mock catalogs for comparison. The major problem arises from the 
fact that the small-scale resolution of the analysis, ~ 20/i~^Mpc, is over twice the scale of 
non-linear clustering. This makes direct analytical calculations problematic since we never 
truly sample erg. If it is assumed that the analysis here is accurate, cxg is expected to be 
under-estimated by about 15% of the true cxg. 
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The number of modes M to use in likelihood analysis mainly affects the estimation 
of the error bars. Thus we can decide from Figure |I|, which number M should be used in 
analyzing the actual LCRS data. One can see that a choice of M = 150 is reasonable for 
the parameter estimation. 

5. Discussion of Results from the LCRS 

We have calculated the LF in the 3D parameter space for both the Northern and 
Southern samples separately, then for the combined set. We used a truncated base with 
M = 150 that was determined from experience with the N-body simulations as described 
above. In Figure several sections of the three-dimensional LF and the marginalized LF 
from the actual LCRS data are shown. In Table |l], the best fit model parameters are 
summarized. The number of iterations is three. The results of the first two iterations are 
also consistent with the final estimation in Table |I|. 

(jg is consistent with expectations from simulations and indicates that true as is 
approximately one. The estimate of the shape parameter, F = 0.16 ± 0.10, is somewhat 
lower than that found in other analyzes although consistent within errors as derived from 
the simulations. For example, Feldman et al. (1994) find F = 0.20 and Landy et al. (1996) 
find F = 0.24 below 75/i^^Mpc. If 6 ^ 1, the parameter values in the Table |l| indicate a low 
value of fio ~ P^'^'^ ^ 0.5. 

Here, it should be noted the limitations of these results with respect to a CDM 
three-dimensional parameterization of the shape and amplitude of the power spectrum. 
Earlier work by Broadhurst et al. (1989) and Landy et al. (1996) have shown a perturbation 
of the power spectrum on 100/i~^Mpc scales. This 'bump' in the power spectrum cannot 
be resolved by such a parameterization and would lead to an under-estimation of F as 
the fit finds an average shape. The other models of the large-scale structure, including 
FIB or defect models, can be also studied using the present formalism with appropriate 
parameterizations of power spectrum. 

The method we have developed here can be straightforwardly applied to redshift 
data of any geometry and of any selection function. By restricting our analysis to the 
large-wavelength modes, our method does not depend much on correction for nonlinear 
effects. The error bars of the results clearly show the advantages offered by surveys of 
larger volume and more isotropic geometry, like SDSS, which would increase the number of 
independent large scale modes substantially. 
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Table 1: Best fit models from tlie projected likeliliood functions witli M = 150. Error ranges 
correspond to 1 cr of the likelihood functions. 



Sample 




r 


^8^ 


North 


0.24 ±0.42 


0.16±0.11 


0.77 ±0.09 


South 


0.36 ±0.37 


0.17 ±0.08 


0.81 ±0.09 


Combined 


0.30 ±0.39 


0.16 ±0.10 


0.79 ±0.08 
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5 10 

simulations 



Fig. 1. — The estimation of parameters in A'"-body simulations. Horizontal axis indicate 
the different realization of mock catalog. The true values in the simulations are shown by 
long-dashed horizontal lines for /3 and F. The dotted lines are for M — 100, solid hues for 
M = 150, and dashed lines for M = 200. 
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Fig. 2. — Likelihood function of LCRS data. Upper panel: Three-dimensional likelihood 
function. Four sections of fixed F are plotted as a contour map, in which la, 2a, and 
3a" confidence levels are shown. Lower panel: Marginalized likelihood functions of each 
parameter. Number of KL modes used in likelihood analysis is varied. Dotted lines are for 
M = 100, solid fines for M = 150, and dashed lines for M = 200. 



